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ABSTRACT 

The first crystal structure of human telomeric DNA 
in complex with the natural alkaloid berberine, 
produced by different plant families and used in 
folk medicine for millennia, was solved by X-ray dif- 
fraction method. The G-quadruplex unit features 
all-parallel strands. The overall folding assumed by 
DNA is the same found in previously reported crystal 
structures. Similarly to previously reported struc- 
tures the ligand molecules were found to be 
stacked onto the external & and 3 -end G-tetrads. 
However, the present crystal structure highlighted 
for the first time, the presence of two berberine mol- 
ecules in the two binding sites, directly interacting 
with each tetrad. As a consequence, our structural 
data point out a 2:1 ligand to G-tetrad molar ratio, 
which has never been reported before in a telomeric 
intramolecular quadruplex structure. 



INTRODUCTION 

The 3'-end single-stranded overhang of human telomeric 
DNA is currently considered an interesting target in new 
anticancer therapy strategies. This is due to its ability to 
form, in the presence of stabilizing agents, such as mono- 
valent cations, non-canonical G-quadruplex folds 
(Scheme la), which in their turn, can inhibit the activity 
of telomerase. This enzyme is involved in the cell life cycle 
and is over-expressed in most of the cancer cells. This 
imbalance has an impact on the recruitment of telomere- 
associated proteins required for the capping and mainten- 
ance of telomeric ends (1-3). 



Many efforts have recently been devoted to the devel- 
opment of new G-quadruplex stabilizing molecules as po- 
tential anticancer drugs. Compounds found to be active in 
the binding to the telomeric G-quadruplex include anthra- 
quinones, acridines, quinacridines, porphyrins, perylenes 
and the natural macrocyclic molecule telomestatin (3,4). 
Most of them are charged positively and contain a large 
aromatic surface. 

The nuclear magnetic resonance and X-ray structures of 
a few telomeric quadruplex-small molecule complexes in 
the presence of K + ions have already been reported and 
studied (5-11), and these all feature a quadruplex parallel 
stranded topology (12). In almost all of these cases, only 
one molecule of ligand was observed stacked onto an 
external G-tetrad (5,6,9,11). Isoquinoline alkaloids, such 
as berberine (Ber) (Scheme lb), possess structural features 
similar to those mentioned earlier, and constitute an im- 
portant class of natural products, produced by different 
plant families, and used in Ayurvedic, Chinese and 
Middle-Eastern folk medicine for millennia. Moreover, 
they possess striking biological and pharmacological 
activities, and have an emerging role in contemporary bio- 
medical research. In the past few years, several papers 
have been published discussing the binding properties of 
the natural benzophenantridine alkaloids berberine and 
berberine-derivatives toward G-quadruplex structures, 
illustrating again that the properties and behavior of 
these compounds are becoming a key area of study and 
interest (13-18). 

Biophysical and biochemical studies have shown that 
these substances bind to G-quadruplex structures formed 
by the human telomeric, the c-myc oncogene and other 
biologically relevant G-rich sequences (19-21). In the 
case of berberine, a stacking interaction with 1 : 1 binding 
stoichiometry has been observed and documented, with a 
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Scheme 1. (a) Schematic representation of three guanine tetrads and 
monovalent cations as found in G-quadruplex telomeric DNA 
(b) Berberine (Ber) molecule drawing. 



high binding affinity for a 22-mer human telomeric 
sequence in K + containing solution (22). Moreover, the 
induction of an antiparallel G-quadruplex in a 21-mer 
human telomeric sequence has been evidenced by 
circular dicroism measurements also without the 
addition of monovalent cations (23). 

Isoquinoline alkaloids show a notable efficiency in tel- 
omerase inhibition and, most importantly, they are also 
selective toward G-quadruplex over duplex DNA (20). 
Furthermore, semi-synthetic derivatives of berberine, es- 
pecially those 9-0 substituted, which contain charged 
aza-aromatic terminal groups, show an even higher speci- 
ficity and have an increased ability over berberine to 
inhibit telomerase, suggesting that they might become 
lead compounds for the development of new anticancer 
drugs (23,24). 

The X-ray diffraction analysis of berberine complexed 
with double helix DNA d(CGTACG) was recently re- 
ported (25). However, to the best of our knowledge, no 
atomic detail structural data have been published up till 
now on the adduct formed by berberine or berberine- 
derivatives with the G-quadruplex. Thus, to shed more 
light on the binding features of this natural alkaloid, we 
solved the structure of the berberine/d[TAG3(T 2 AG3)3] 
complex (referred as Ber/h-Tel23 in the text later). 



MATERIALS AND METHODS 

The d[TAG3(T 2 AG 3 )3] sequence was purchased from Jena 
Bioscience (Jena, Germany), HPLC purified berberine 
chloride was purchased from Sigma- Aldrich Co. Ltd., 
USA. The purified oligonucleotide was dissolved to a 
final concentration of 4mM, in a solution containing 
20 mM potassium cacodylate buffer at pH 6.5 and 
50 mM potassium chloride, and then heated to 368 K for 
5min. It was subsequently allowed to cool overnight to 
room temperature to induce the quadruplex formation. 

A search for crystallization conditions was performed 
using a home-made crystallization screen based on the one 
described by Campbell and Parkinson (26). The DNA 
quadruplex-drug complex was crystallized at 296 K 
using the sitting drop vapour diffusion method. One 
microliter of the 4mM solution of the human telomeric 
sequence d[TAG3(T 2 AG 3 )3] was mixed with 1 |iL of a 
2 mM solution of berberine in water, and with 2 |iL of a 
solution containing the following: 1.8 M ammonium 



sulphate, 0.2 M KC1, 0.05 M Li 2 S0 4 and 0.05 M potas- 
sium cacodylate with pH = 6.5. Drops were equilibrated 
against this latter solution. Yellow hexagonal crystals of 
the complex appeared after 2 weeks. 

Data collection on crystals of the quadruplex-drug 
complex was performed at 100 K, using as cryoprotectant 
a solution analogous to the crystallization solution but 
with an increased concentration of ammonium sulphate 
(1.9 M) and 30% glycerol. The berberine-quadruplex 
complex crystallizes in the space group P6, with unit cell 
parameters a = 65.620 A and c = 41.81 A. 

Data for the complex were collected at the XI 2 
beamline (EMBL, DESY, Hamburg) to a maximum reso- 
lution of 2.4 A, using a wavelength of 1.000 A. Data were 
integrated using the program mosflm (27,28) (and scaled 
with SCALA from the CCP4 program package (29). Unit 
cell parameters and data collection statistics are given in 
Table 1. 

The structure of the complex was solved by molecular 
replacement using the program Molrep (30). Coordinates 
of the complex of the same DNA sequence with a 
tetra-substituted naphthalene diimide (9) (PDB code: 
3CDM) were used as a search model, after deleting 
atoms belonging to ligand and solvent molecules. 

Fo-Fc electron density maps revealed a clear density for 
the drug molecules and they were introduced and refined 
at full occupancy. Two spherical densities were also 
present in the Fo-Fc electron density maps inside the 
G-quadruplex channel that could be interpreted as potas- 
sium ions. The model was refined using the program 
Refmac5 (31) from the CCP4 package (29). Manual re- 
building of the model was performed using the program 
Coot (32). The crystal packing analysis was performed 
with the program Mercury (33). Coordinates of the 
complex have been deposited with the Protein Data 
Bank (PDB access number 3R6R). 



RESULTS 

The complex crystallizes in the P6 space group, with each 
asymmetric unit containing an intramolecular 
G-quadruplex in complex with berberine in a DNA/ 
ligand molar ratio of 1:3. The overall DNA topology is 
of parallel type and quite similar to those described for the 
native 22-mer intramolecular quadruplex (34), and also 
for the h-Tel23 in complex with a tetra-substituted naph- 
thalene diimide (9,1 1). The analysis of the crystal structure 
(Figure 1) shows a parallel orientation of all strands and a 
square planar^ arrangement of the G-tetrads, which were 
observed 3.2 A apart from each other and featuring an 
average 29° twist between each successive step. 
Monovalent K + cations, featuring an antiprismatic coord- 
ination environment, are positioned between adjacent 
G-tetrad, with an average distance of 2.7 A from the 06 
carbonyl atoms of the guanine residues (with maximum 
and minimum bond lengths of 3.0 and 2.7 A, respectively). 
All three loops protrude from the stacked tetrad plat- 
forms, with the adenine in each TTA link intercalated 
between the two thymines. In contrast to the structure 
of h-Tel23 in complex with the naphthalene diimide, no 
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Table 1. Summary of data collection and atomic model refinement 
statistics 



Data collection o 
Wavelength (A) 
Space group 
Cell dimension (A) 
Limiting resolution (A) 
Unique reflections 

-^sym 

Multiplicity 

Completeness overall (%) 
<I/a(l)> 
Refinement 

Resolution range (A) 

Unique reflections, working/free 

^factor (%) 
^free (%) 

RMSD bonds(A) 
RMSD angles (°) 



1.000 
P 6 

a = 65.620, c = 41. 
2.3 (2.42-2.3) 
4674 

0.088 (0.68) 
22.2 (22.3) 
100.0 (100.0) 
24.9 (1.1) 

30.0-2.4 

4128/188 

21.5 

25.2 

0.008 

2.451 



Values in parentheses are for the highest resolution shell. 




Figure 1. DNA topology and binding sites in the Ber/h-Tel23 
adduct (Ber molecules related by C 2 pseudosymmetry are omitted for 
clarity). 



ligand molecules intercalate in the loops. These loops have 
the same distinctive conformation, as has been found in 
the native 22-mer structure (34). The crystal packing 
features dimers of symmetry related (2-fold axis) intramo- 
lecular G-quadruplexes connected by the A2 and T13 
residues belonging to different DNA monomers 
(Figure 1). They are linked by Watson and Crick type 
H-bonds, as found for the structures of the native 
22-mer and the diimide/h-Tel23 complex (9,11,34). As a 
consequence, the two Spends define a binding site where 
two coplanar Ber molecules are stacked, and lying 3.4 A 
apart from both the 5'-end tetrads (Figure 2). Each Ber 
molecule interacts with a pair of adjacent guanines per 
tetrad. As a result of this placement and of the 2-fold 
symmetry relationship featuring the binding site, each 



ligand molecule actually interacts with the same couple 
of guanine from both the tetrads: G3/G9 in one case 
and G15/G21 in the other (Figure 2). 

The 2-fold crystallographic axis, relating the two 
G-quadruplexes, in addition, bisects the Ber molecules, 
that are disordered and possessing the dyad axis only as 
a pseudo-symmetry (Figure 3a). Only the pair formed by 
the combination of Ber molecules with the metoxy groups 
pointing in the same direction is allowed in the case of the 
5'-end binding site. With reference to Figure 3a, the pair 
can be formed either by the two pink or by the two gray 
Ber molecules. Certainly, the ligand pair in the binding site 
cannot be formed by Ber molecules with methoxy groups 
pointing in the opposite direction, due to the short, not 
allowed contacts that would otherwise be formed between 
the molecules belonging to the pair (Figure 3a, dashed 
lines). Interestingly, Ber molecules in each couple 
interact with each other through van der Waals inter- 
actions and define an approximate four-sided polygon, 
which leaves a free central channel, the one occupied by 
the monovalent ions (Figure 3b). It is worth noting that, 
as clearly shown in Figure 3b, the distance between the 
berberine carbon atoms C3 and C10 (9.3 A) almost cor- 
responds to the distance between the N9 nitrogen atoms of 
the guanine residues of the interacting pairs (9.6 A), 
namely G3/G9 and G15/G21. Thus, the isoquinoline 
system of each Ber molecule is very well overlapped on 
the purinic moieties belonging to the G3/G9 and G15/G21 
couples. 

Two further Ber molecules are stacked on the 3 / -end, 
about 3.4 A apart from the tetrad. This binding site is 
completed by the A2-T13 base pair from a different 
G-quadruplex dimer, symmetry related by a 3-fold axis 
(Figure 4a). As well as for the 5'-end binding site, each 
Ber molecule is at stacking distance with only two adjacent 
guanines (G5/G11 and G17/G23, respectively; Figure 4b). 
Also, in this case, the isoquinoline moiety perfectly 
overlaps onto the base pair. However, unlike what 
occurs for the 5'-end binding site, the two stacked ligand 
molecules form a dihedral angle of 19.7°, so maximizing 
the stacking interaction with the 3'-end tetrad, which is 
bent as well, with a dihedral angle of 13.9° between the 
planes defined by the G5/G11 and G17/G23 pairs. 

It is to be noted that the absence of a planarity of the 
3'-end tetrad is a common feature for the intramolecular 
telomeric G-quadruplex and it is present in all the to date 
known crystal structure of both native and complexed 
DNA, where the tetrad is slightly funnel-shaped 
(dihedral angles between adjacent guanines ranging from 
12° to 18°) (9,11,34). Instead, the strong interaction estab- 
lished in the here described structure between the ligand 
molecules and the 3'-end tetrad, makes the guanines in 
each pair (G5/G11 or G17/G23) indeed coplanar 
(dihedral angles falling in the range 5-6°). 

In a similar way to the ligands positioned on the 5'-end, 
the two Ber molecules define a four-sided polygon, but 
here their methoxy groups point in the opposite direction 
(Figure 4c). The molecules are in contact through CH O 
H-bonds (CO distance 2.6 A; Figure 4b). The open 
central channel is again visible, and occupied by a dis- 
ordered water molecule, which lies 3.3 A far from the 
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Figure 2. S'-end binding site for the Ber molecules (the other couple of Ber molecules, related by C 2 pseudo symmetry axis, is omitted for clarity). 




Figure 3. (a) Disordered Ber molecules in the S'-end binding sites. Only 
the pair of molecules featured by the same color is really present in the 
binding site. The position of the crystallographic 2-fold axis, relating 
the disordered Ber molecules, is also shown. Skeleton and Fo-Fc omit 
map for each Ber contoured at 1.5a level, (b) Top view of a couple of 
Ber molecules stacked onto the 5'-end G-tetrads: (the other couple of 
Ber molecules, related by C 2 pseudosymmetry, is omitted for clarity). 



nearest potassium ion. This water molecule forms 
H-bonds with the 06 carbonyl oxygen from the guanine 
residues (O O mean distance 3.0 A) and CH O inter- 
actions (C ' "O mean distance 3.2 A) with the Ber molecules 
contributing to fix their orientation. Instead the symmetry 
featuring the 5'-end binding site, as well as the absence of 
water molecules, could explain the disorder observed for 
the ligands in this binding site. 



DISCUSSION 

The structure of the complex here reported identifies two 
different binding sites, each containing two ligand mol- 
ecules stacked side-by-side onto the tetrad. The overall 
3:1 binding stoichiometry here found differs from the 
ligand to quadruplex molar ratio previously reported in 
literature for the adducts formed in diluted K + solution 
between Ber derivatives and the 22-mer telomeric DNA. 
These complexes exhibit a 1:1 binding stoichiometry 
(19-24) with DNA, which is present in solution as a 
mixture of hybrid 1 and hybrid2 topology. 

The two Ber molecules stacked on the tetrads represent 
a significant novelty when compared with the structural 
data available to date regarding unimolecular 
G-quadruplex formed by telomeric DNA. Indeed, 
a ligand to tetrad ratio different from 1:1 has only been 
previously observed in two cases. The first refers to 
the X-ray structure of tetramolecular adduct 
daunomycin/d(TGGGGT), where three ligand molecules 
on a G- tetrad were found (35), and the second describes 
an acridine derivative adduct with the TERRA RNA (7). 
The latter complex is characterized by a 2:1 ligand to 
G-tetrad molar ratio stoichiometry, but the ligand gives 
rise to a 1:1 molar ratio when complexed with telomeric 
DNA in an all parallel arrangement (7). The stoichiomet- 
ric ratio greater than 1:1 found for the adducts of (7) and 
(35) is, most likely, favored by the folding topology of the 
target and ultimately on how the loops connect the 
tetrads. In these complexes, as well as in our X-ray 
solved complex where G-DNA is in parallel conformation, 
loops connect opposite ends of the quadruplex leaving the 



636 Nucleic Acids Research, 2013, Vol. 41, No. 1 




Figure 4. (a) Binding site for the Ber molecules at the 3'-end G-tetrad; 
(b) H-bond contacts established by the HOH9 water molecule in the 
3'-end binding site; (c) top view of the two Ber molecules stacked on to 
the 3'-end G-tetrad: skeleton and Fo-Fc omit map for each Ber con- 
toured at 1.5a level. 



tetrads completely available for binding. Among the 
foldings reported to date for telomeric intramolecular 
G-quadruplex, the parallel is the only one featuring both 
external tetrads which are not hindered by the adenine and 
thymine residues. Instead, the hybrid type, typical of 
diluted K + containing solutions, presents partially not ac- 
cessible tetrads. Because for some time the parallel 
G-quadruplex has been considered to be exclusively 



distinctive of the solid state and to be induced by the 
crystal packing forces, one could doubt its biological rele- 
vance. However, recent literature has reported that under 
crowding conditions telomeric sequences might co-exist as 
an equilibrium mixture of different kinds of arrangements, 
also including the parallel one (36-38). In addition, while 
the antiparallel G-quadruplex conformation is induced by 
a number of compounds (39), ligands are known which 
are able to induce, in solution, the parallel arrangement of 
telomeric DNA (40). Thus, the question about the most 
likely possible form of the telomeric DNA sequence in vivo 
or at least in experimental settings simulating the in vivo 
condition (41-44), is still strongly debated and researchers 
are far from reaching a convergence of opinions. 
Moreover, it has been demonstrated that when working 
with significantly longer telomeric sequences than those 
normally used in experimental studies (in the range 
21-26 nt), the very same ligand molecule can give rise to 
complexes featuring a ligand to DNA molar ratio that is 
different from those found in monomer forming sequences 
(45). This behavior would suggest that the quadruplex 
units along a multimer structure are not completely 
independent. 

The three-dimensional arrangement of the full length 
telomeric overhang can lead the G-tetrads of not consecu- 
tive G-quadruplex units into stacking distance. This 
allows the formation of additional putative binding sites 
besides the quadruplex-quadruplex interfaces of consecu- 
tive units. The driving forces of the interactions that are 
established to these interfaces are mainly of electrostatic 
and van der Waals types, not affected by the fact that the 
facing ends are of 3 -5' 5 3 -3' or 5 '-5' type (12). Therefore, 
given the nature of these interactions, the two molecules of 
Ber, that in our experiments interact with the 5'-ends of 
two short independent G-quadruplex units (23 nucleotide 
bases), may establish in vivo stacking interactions onto 3' 
or 5' G-tetrads of non-consecutive G-quadruplexes be- 
longing to long telomeric sequences. Indeed, it has been 
proposed that telomeric DNA and RNA might afford dif- 
ferent higher-order arrangements of parallel-stranded 
G-quadruplexes (12). These include the 5-5' stacking 
mode we have experimentally observed, and also the 
3 / -3 / and the 3'-5' one. 

The presence of two Ber molecules stacked side-by-side 
onto the tetrad is due, most likely, to how small the 
molecule is compared with other ligands whose 
telomere/ligand structure is reported in literature (5,6,9). 
As observed in the Results section, the Ber-molecule gives 
rise, both in 3'- and 5'-end binding site, to an optimal 
dimensional match with a pair of guanines, so maximizing 
the Ti-stacking interaction between the aromatic moieties 
and leaving room in the binding site to lodge a second 
molecule of ligand. The really strong interaction can 
also be inferred from the coplanarity of the G5/G11 and 
G17/G23 pairs and from the modified conformation of the 
3'-end tetrad with respect to the till now reported intra- 
molecular telomeric G-quadruplex structures, at least in 
the solid state. 

The size of the Ber molecule may also explain its 
G-quadruplex over duplex selectivity (46). In fact, while 
the two Ber molecules can be easily located on the 
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telomeric tetrad, the results obtained from the X-ray dif- 
fraction analysis of the d(CGTACG)-Ber complex point 
out that this planar ligand is too wide to give rise to a 
canonical intercalation in the ds-DNA, and is forced to 
intercalate at the junction of two consecutive distorted 
double helices (25). 

Summarizing, this study presents the first X-ray struc- 
ture of the adduct formed between Ber and the telomeric 
DNA, and provides a key to interpret the observed select- 
ivity of the ligand toward the G-quadruplex compared 
with the DNA double helix. The solved structure high- 
lights a stoichiometric ratio that is different from the 
one observed in diluted solution for the adducts with the 
hybrid type folding. In particular, it shows the presence of 
two Ber molecules stacked side-by-side onto the tetrad. 
These data, together with the recent literature that con- 
siders the parallel conformation as one of the biologically 
relevant folding topology for telomeric DNA, show new 
possibilities for ligand to DNA quadruplex binding, that 
have never been considered up to now. In our opinion, 
this interaction mode provides an important clue for in- 
terpreting binding data obtained with experimental 
settings which simulate the in vivo conditions. 
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